Socrates - A System For Scalable Graph Analytics
نویسندگان
چکیده
A distributed graph processing system that provides locality control, indexing, graph query, and parallel processing capabilities is presented. Keywords—graph, distributed, semantic, query, analytics
منابع مشابه
Scalable Analytics over Distributed Time-series Graphs using GoFFish
Graphs are a key form of Big Data, and performing scalable analytics over them is invaluable to many domains. As our ability to collect data grows, there is an emerging class of inter-connected data which accumulates or varies over time, and on which novel analytics – both over the network structure and across the time-variant attribute values – is necessary. We introduce the notion of time-ser...
متن کاملMOCgraph: Scalable Distributed Graph Processing Using Message Online Computing
Existing distributed graph processing frameworks, e.g., Pregel, Giraph, GPS and GraphLab, mainly exploit main memory to support flexible graph operations for efficiency. Due to the complexity of graph analytics, huge memory space is required especially for those graph analytics that spawn large intermediate results. Existing frameworks may terminate abnormally or degrade performance seriously w...
متن کاملOn Software Infrastructure for Scalable Graph Analytics
OF THE DISSERTATION On Software Infrastructure for Scalable Graph Analytics By Yingyi Bu Doctor of Philosophy in Computer Science University of California, Irvine, 2015 Professor Michael J. Carey, Chair Recently, there is a growing need for distributed graph processing systems that are capable of gracefully scaling to very large datasets. In the mean time, in real-world applications, it is high...
متن کاملGraphing trillions of triangles
The increasing size of Big Data is often heralded but how data are transformed and represented is also profoundly important to knowledge discovery, and this is exemplified in Big Graph analytics. Much attention has been placed on the scale of the input graph but the product of a graph algorithm can be many times larger than the input. This is true for many graph problems, such as listing all tr...
متن کاملGoFFish: A Framework for Distributed Analytics over Timeseries Graphs
Massive datasets from scientific instruments and enterprises were the initial Big Data frontiers. But these are being subsumed by complex, high-velocity data from ubiquitous sensors and social network streams. Such datasets are characterized by both temporal attributes and lateral relationships between them forming a graph structure, and scalable data analytics frameworks have not been adequate...
متن کامل